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Abstract 

F0G1 is a transcriptional regulator that acts in concert with the hematopoietic master regulator GATAl to coordinate the 
differentiation of platelets and erythrocytes. Despite considerable effort, however, the mechanisms through which FOGl 
regulates gene expression are only partially understood. Here we report the discovery of a previously unrecognized domain 
in FOGl: a PR (PRD-BFl and RIZ) domain that is distantly related in sequence to the SET domains that are found in many 
histone methyltransferases. We have used NMR spectroscopy to determine the solution structure of this domain, revealing 
that the domain shares close structural similarity with SET domains. Titration with S-adenosyl-L-homocysteine, the cofactor 
product synonymous with SET domain methyltransferase activity, indicated that the FOG PR domain is not, however, likely 
to function as a methyltransferase in the same fashion. We also sought to define the function of this domain using both 
pulldown experiments and gel shift assays. However, neither pulldowns from mammalian nuclear extracts nor yeast two- 
hybrid assays reproducibly revealed binding partners, and we were unable to detect nucleic-acid-binding activity in this 
domain using our high-diversity Pentaprobe oligonucleotides. Overall, our data demonstrate that FOGl is a member of the 
PRDIVl (PR domain containing proteins, with zinc fingers) family of transcriptional regulators. The function of many PR 
domains, however, remains somewhat enigmatic for the time being. 
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Introduction 

The activity of the transcription factor GATAl in erythroid 
development is modulated by a range of coregulators, including 
Friend of GATA 1 (FOGl). FOGl is a nine-zinc-finger protein 
(Figure lA) that is essential for proper differentiation and 
maturation of both megakaryocytes and erythroid precursors [1]. 
FOGl knockout mice die at ElO.5-11.5 due to severe anaemia 
with arrest in erythroid development, a phenotype that is related 
to that observed in GATAl knockout mice [2]. FOGl and 
GATAl interact both functionally [3] and physically [4], and 
disruption of the normal interaction of FOGl and GATAl has 
been linked to a range of inherited blood disorders (reviewed in 
[5]). 

Despite FOGl containing nine classical zinc-fmger domains, 
there is no evidence to date that the protein binds directly to 
nucleic acids, suggesting that FOG 1 most likely regulates GATA 1 
activity by recruiting co-regulator complexes. FOGl is required 
for both the activation and the repression of most GATAl target 
genes [6-9]. FOG-mediated repression of GATAl in transient 
transfection assays and ectopic expression both depend on its 



ability to recruit the co-repressor C-terminal binding protein 
(CtBP) [10-12], via a PXDL motif between zinc fingers 6 and 7. 
However, FOGl does not appear to require its major PXDLS 
CtBP-binding motif during erythropoiesis since mice carrying a 
FOGl mutant with reduced CtBP binding develop normally [13], 
suggesting FOGl recruits another repressor complex during 
erythropoiesis. 

The N-terminus of FOGl appears to be particularly important 
for its function. During megakaryopoiesis, deletion mutants 
lacking residues 1-144 could at least partially rescue erythroid 
but not megakaryocyte maturation, suggesting a lineage specific 
role for the N-terminal region [13]. Subsequently, residues 1-12 of 
FOGl were shown to be able to mediate transcriptional repression 
by GATAl [14], via recruitment of the nucleosome remodeling 
and deacetylation (NuRD) complex [15-17]. Similarly, the N- 
terminal region of FOG2 represses GATA-4 activity [17], 
although the possibility remains open that other regions might 
contribute to repression. 

As part of an effort to understand the molecular mechanisms 
through which FOGl regulates gene expression during hemato- 
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poietic development, we analyzed the amino acid sequence of 
murine FOGl (Uniprot: 035615). PONDR (http://www.pondr. 
com/), a program that predicts the distribution of structured and 
natively disordered regions in a protein sequence, predicted that 
part of the region P100-V254 of FOGl is likely to be well- 
ordered. Sequence comparisons reveal similarity of up to ~30% to 
the PR (PRDI-BFl and RIZ homology) domains found in the 
human proteins PRDMl-17 (Figure IB). The relatively low 
degree of similarity means that programs such as Interpro (http:/ / 
www.ebi.ac.uk/interpro/) do not reveal any domains in FOGl 
other than the nine well-characterized classical zinc fingers 
indicated in Figure lA. 

PRDM-famUy proteins are gene regulatory proteins that are 
found in metazoans, but not plants or fungi. Seventeen such 
proteins have been defined in primates, whereas only two are 
found in the sea squirt Ciona intestinalis, indicating a substantial 
expansion during vertebrate evolution. Their biological roles are 
stiU not well understood in many cases, but a number of family 
members appear to act in stem cells and in cellular differentiation 
(reviewed in [18]). PRDM14 is important in stem cell biology and 
epigenetic reprogramming (reviewed in [19]), PRDM3 is required 
for the integrity of heterochromatin [20] and PRDM16 is essential 



for maintaining adipocyte identity [21]. Not surprisingly therefore, 
dysregulation of PRDM activity has been associated with several 
different types of cancer [22-24] . 

All 1 7 human proteins contain an N-terminal PR domain and 
all but PRDMl 1 contain an array of between four and fifteen 
classical zinc fingers clustered in a range of different patterns 
[18,25]. The PR domain bears structural similarity to the catalytic 
SET domains (named for the Drosophila proteins Suppressor of 
variegation 3-9, Enhancer of zeste and Trithorax) found in histone 
lysine methyltransferases [26], although in general many of the 
residues associated with catalytic activity in the SET domains are 
not conserved in PR domains. Despite the absence of these 
residues, however, methyltransferase activity has been observed in 
at a number of PRDM proteins (PRDM2, -3, -6, -8, -9 and -16) 
[20,27-31]. Members of the family have also been demonstrated 
to act as sequence-specific DNA-binding proteins (most likely 
through their zinc-finger domains) or as protein-recruitment 
agents at gene regulatory elements, and at this stage a clear 
consensus view of the function of these proteins as a class has not 
yet emerged. 



NuRD 



imaininiiiB 



CtBP 



995 



PRDm/BLIMPI 184-201 
PRDM2/RIZI28-141 
PRDM3/EVI1 178-1 90 . 
PRDM4I41 2-529 
PRDM5I8-124 
PRDM6I246-365 
PRDM7I244-3S8 
PRDMel16-131 
PRDM9I244-358 
PRDMl 01208-326 
PRDMl 111 49-260 
PRDM12I86-203 
PRDM13I1-112 
PRDMl 4I2S1 -367 . . 
PRDMl 5/ZNF298I41 5-525 
PRDM16I82-211 
PRDM17I58-172 . P 
hFOG1l86-203 P WS 

XF0G1I1-11S . WN 

mFOG1j100-205 P WS 



RNWL FK 




M'SK EY 

A T K P 
T K R K 
TG ET 
T A RR 

C AAQR 
E AS □ 
T CD 
E AS D 
K RR . 
E V . . 
_ K TW . 
S AD CC . 
C SS F . 
A I T Q . 
A K R K. 

C VG DP 

OR R I R A RL S 
H I R A RN Q 
QRBv R a RL S 



.R GEBHl 



.Y YP NL G 
. I LD EF YN 



.G SK GE 
. . D GT 
. . . .R 
. . .1 F R 



WMB 
KF 
E F C| 

L V 
QH F 

Y E > 
D T 8 

Y E 



F R 1 
.R t\0 

GD. . . .RKERDLHED LW FE 

N N R YK 

DGTCJRYFl 

M V R GE LVDl; 

DGHBSHFL~ 

G HPV C fE 

VE(10)ISEDLGSEKF 

L S L G PW6_DV 

.Q A 

P 

R 



I A EHJ S PAD T 
' ADBN I SRS 

I eeBg paDt 



G F N EE K 
ATD PEK 
A SQ PD V 
T T D E N EC 
ATN PR H 
G GE P S K 
G K D KS SA 
NG SS EG 
G K D K S WA| 
L SD ET L 
GSD ET K 
A SQ ED H 
S GG SP . 
G K GG . T 
TSN EDD 
ANQAG A 
G A CE QS 
L L LV D E 
.WDVDG 
. L MV D E 




:R X X N H X C ' 



Figure 1. Domain structure and sequence analysis of FOGl. (A) Domain structure of murine FOGl. C2HC type zinc fingers are light grey; C2H2 
fingers are unshaded. The binding sites for CtBP and the NuRD complex are indicated, as is the newly identified PR domain. (B) Sequence alignment 
of murine F0G(1 00-205) with human and Xenopus laevis FOGl and with all human PR domains. Colouring indicates conservation at four different 
levels. The essential catalytic consensus motif found in SET domains is shown below the alignment and indicated with a dashed box. Secondary 
structure elements in FOG-PR are indicated below the alignment. Alignment was carried out using CLUSTAL OMEGA [49] and the diagram made 
using ALINE [50]. 

doi:1 0.1 371/journal.pone.01 0601 1 .gOOl 
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Determination of the solution structure of F0G1-PR 

To characterize this predicted domain in FOGl, we overex- 
pressed (as a GST-fusion protein in Escherichia coli) a p()l)'peptide 
corresponding to residues Pl()()-V254 of murine FOGl (hereafter 
referred to as FOG-PR) and then purified it using GSH-affinity 
and, following removal of the GST by cleavage with thrombin, 
size exclusion chromatography, Far-UV circular dichroism and 
one-dimensional 'H NMR spectra (not shown) revealed that this 
polypeptide contained substantial P-sheet secondary structure and 
took up a well-defined conformation in solution. Size exclusion 
chromatography with in-line multi-angle laser light scattering 
(MALLS) gave a mass estimate of ~17.2 kDa, indicating that 
FOG-PR (MWtheor. = 17.0 kDa) is monomeric in solution and 
suggesting that FOG-PR should be a suitable candidate for 
structural analysis by NMR spectroscopy. Accordingly, the ' 'N- 
HSQC spectrum of FOG-PR contains approximately the expect- 
ed number of signals for a 150-residue protein and displays 
hnewidths and chemical shift dispersion consistent with a folded 
monomer (Figure 2). 

We went on to determine the solution structure of FOG-PR 
using multidimensional heteronuclear NMR methods. Assign- 
ments were made for ~95% of expected backbone 'H, ''^C and 
'■^N nuclei, and ~85% of side chain 'H, ' ^C nuclei in the region 
P100-E207. Littie or no data were observed for the residues E147 
and El 48, and assignments could only be made with confidence 
for a small number of residues in the C-terminal region (P208- 
V254). Approximately 25 signals in the '-^N-HSQC therefore 
remained unassigned, but nearly all of these had rather narrow 
Unewidths, chemical shifts in the range ~7.8-8.5 ppm and few 
NOEs in a '^N-edited NOESY spectrum. Taken together, these 
observations indicate that the C-terminal part of the polypeptide is 
disordered. The results of limited proteolysis carried out on 
FOGl (100-2.54) were also consistent with this conclusion, 
identifying a major proteolysis product corresponding to 
FOG1(100-214). 

'^N-edited, '^C-edited and 2D NOESY spectra were peak- 
picked and CYANA 3.0 [32] was used to assign NOEs and 
calculate structures. The 50 lowest energy structures, calculated 



with 1244 distance restraints and TALOS-l-derived dihedral angle 
restraints for 89 residues, were refined in explicit water using CNS, 
according to the RECOORD protocol [33]. The 20 lowest energy 
structures were used to represent the structure of FOG-PR 
(Figure 3 and Table 1). This family of structures has a backbone 
RMSD (over residues with cp and v|/ angle order parameters of > 
0.95) of 0.67 A, and exhibits no NOE violations >0.5 A. The well- 
ordered region of the protein was defined as ranging from G103- 
1137 and D158-V205; the sequence Q.138-V157 and the 
sequences N- and C-terminal to the ordered region exhibited 
random-coil chemical shifts. ga\'e rise to no non-sequential NOEs, 
and had very low backbone angle order parameters in the final 
structures. PROCHECK_NMR analysis showed that, on a 
Ramachandran plot, 99.8% of residues in the well-ordered region 
of the protein fall within the most favoured or additionally allowed 
regions (calculated for non-Pro, non-Gly residues). '^N Ti, T2 and 
heteronuclear NOE data were consistent with this arrangement 
(Figure 4). 

Comparison of the structure with SET and other PR 

domains 

The region encompassing the PR domain (P100-V205) of 
murine FOGl has ~90% and ~67% sequence similarity with the 
corresponding regions of human FOGl and Xenopus laevis 
FOGl, respectively, indicating that this structure is conserved 
across all FOGl homologues. It is also clearly conserved in FOG2, 
although does not appear to be present in the related Drosophila 
prot(;in U-sliapcd. Overall, the fold closely resembles that of the 
enzymatic SET domains found in many lysine methyltransferases 
[34]. Examination of the >10 structures of SET domains 
determined to date shows that the domain consists of a number 
of semi-modular subunits. The core SET domain appears to 
comprise N- and C-tcrminal regions (SET-N and SET-C) that are 
relati\'ely invariant l)etwe('n different domains, together with a 
central SET-I region that has widely var^'ing length and structure 
[35]. In addition, flanking domains (pre-SET and post-SET 
domains) are generally observed, which are also somewhat 
variable in nature. 




'H chemical shift (ppm) 



Figure 2. ''^N-HSQC spectrum of FOGl. The spectrum was recorded on 0.5 mM FOG-PR at 298 K on a 600-MHz NMR spectrometer. The central 
portion of the spectrum (boxed) is labeled with assignments In the expansion. Signals with no label are unassigned. 
doi:1 0.1 371/journal.pone.01 0601 1 .g002 
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Figure 3. NWIR solution structure of FOG-PR. (a) Overlay of bacl<bone traces of the 20 lowest energy structures following RECOORD refinement. 
NMR data indicated that residues 100-102 and 207-254 are disordered, and these are not shown, (b) Ribbon diagram of the lowest energy structure 
of FOG-PR. The N/C-termini, residue numbers and the position of the pseudoknot are indicated, (c) Comparison of the DIMS SET domain structure 
(PDB: 1PEG [36], left) with FOG-PR (right). Corresponding elements of structure are coloured similarly in a pattern ranging from red to orange to 
yellow to green to blue to purple (N- to C-terminal end). The regions in gray are a small pre-SET domain and the SET-I variable region that lies 
between the yellow and green regions. The pseudoknot is apparent as the purple C-terminal )3-strand that passes through the green helix/loop. Zinc 
ions are shown as grey spheres, (d) Comparison of the PRDIVI4 PR domain structure (left, PDB 3DB5) with FOG-PR (right). Corresponding elements of 
structure are coloured as in part (b). Diagrams were produced using the programs PYMOL and IVIOLMOL. 
doi:l 0.1 371 /journal.pone.01 0601 1 .g003 



Figure 3C shows a comparison of the structure of FOG-PR 
and the core of the SET domain of DIM-5 [36] . The arrangement 
of secondary structural elements is clearly the same, down to the 
presence of the C-terminal pseudo-knot that is common to all SET 
domain structures solved to date (purple). The SET-N and SET-C 
regions clearly match those in DIM-5 . In contrast, SET-I is an OH- 
P structure in DIM-5 (yellow), whereas in FOG this region simply 
comprises a disordered 12-residue loop. The residues in this loop 
gave rise to broad (or no) signals in the NMR spectra, indicating 
that they participate in |.ls-ms timescale motion; it is possible that 
this region forms a marginally stable structure. DIM-5 also 
displays an elaborate pre-SET domain (grey) that binds three 



Zn(II) ions. A structure-guided alignment of the DIM-5 and FOG- 
PR sequences reveals that the most highly clustered set of 
conserved residues lies in the hydrophobic core that is common 
to the two structures. 

Structures have also been determined for the PR domains of 
PRDMl, 2, 4, 9, 10, 11 and 12 (although only structures of 
PRDM2 have been published [37,38]). These structures more 
closely resemble that of FOG-PR (Figure 3D shows a compar- 
ison of FOG-PR with the PRDM4 PR domain), with fewer 
elaborations than the SET domains. Notably, all seven structures 
of PR domains display a three-stranded P-sheet in the SET-I 
region that is disordered in FOG-PR, although some poorly 



PLOS ONE I www.plosone.org 



4 



August 2014 | Volume 9 | Issue 8 | e106011 



The Structure of a PR Domain in FOGl 




T 1 1 r 

100 120 140 160 180 200 220 240 
Residue 



Figure 4. relaxation data for FOG-PR. Bacl<bone '^N 7",, T2 and heteronuclear '^N-^H NOE values are plotted as a function of residue number. 
Error bars indicate the standard error from the curve fit for 7", and T2 values and the range taken from duplicate measurements for the '^N-^H NOE. 
doi:1 0.1 371 /journal.pone.01 0601 1 .g004 



defined electron density is observed in several PR structures, 
perhaps indicating the presence of some flexibility in this region. 

FOG-PR is unlikely to have methyltransferase activity 

SET domains possess histone methyltransferase activity towards 
specific lysine residues in histones tails, leading to positive and 
negative regulation of gene expression, depending on context. The 
transferred methyl group is derived from the cofactor S-adenyosyl- 
L-methionine (SAM), and the transfer reaction gives rise to the 
cofactor product S-adenyo.syl-L-homocysteine (AdoHcy). Several 
structures have been determined of SET domains in the presence 
of AdoHcy, allowing the identification of the conserved substrate- 
binding pocket of the protein (see, for example, [39,40]). To test 



whether or not FOG-PR might also act as a methyltransferase 
domain, we titrated AdoHcy into a solution of ' N-labeled FOG- 
PR and recorded '^N-HSQC spectra. No changes were observed 
following the addition of up to 1 00 molar equivalents of AdoHcy 
(not shown), suggesting that FOG-PR is unlikely to act as a SAM 
dependent methyltransferase. 

This lack of binding is consistent with the absence of asparagine 
and cysteine residues that are highly conserved in the Ado-Met/ 
AdoHcy co-factor binding region of SET domains [35,40]. These 
residues form part of an H/RxxNHxC motif that is thought to be 
important for enzymatic activity. As noted above, methyltransfer- 
ase activity has been observed in at a number of PRDM proteins, 
suggesting that other residues might well be able to fulfil their 
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Table 1. Experimental restraints and structural statistics for FOG-PR. 





Experimental restraints 


Distance restraints 


1230 


Intraresidue {/,/) 


289 


Sequential 


366 


Medium-range (2< <4} 


124 


Long-range >4) 


451 


Total dihedral angle restraints 


(p 89 




89 


Pairwise RMS deviation from mean structure (residues 103-137, 158-205} 


Backbone atoms (N, C", C) 


0.67±0.13 A 


All heavy atoms (N, C, 0, S) 


1.13±0.15 


PROCHECK statistics 


Residues in most favoured region 


92.4% 


Residues in additional allowed regions 


7.4% 


Residues in generously allowed regions 


0.2% 


Residues in disallowed regions 


0% 


Deviation from idealized geometry (residues 100-226} 


Bond lengths 


0.017 A 


Bond angles 


1.4° 



doi:l 0.1 371/journal.pone.Ol 0601 1 .tOOl 



roles. In the case of PRDM9, a structure has been determined of 
the PR domain bound to both a histone H3-derived substrate 
peptide and AdoHcy [41]. It is notable that a substantial portion of 
the binding site for both molecules is derived from residues in 
either the SET-I region or in the sequence immediately C- 
terminal to the pseudo-knot. Both of these regions are disordered 
in the FOG-PR structure and in general the residues that make 
contacts with AdoHcy and histone H3 do not appear to be 
conserved in FOG-PR. 

Further efforts to pinpoint the function of FOG-PR 

If only some PR domains act as methyltransferases, the question 
arises as to what the function is of the remaining domains. Some 
PRDM proteins have been shown to associate with DNA and to 
recruit other proteins to chromatin [18], and it is therefore possible 
that PR domains can act as either DNA- or protein-binding 
modules. We used gel shift assays to assess the DNA-binding 
properties of FOG-PR. Previously we described Pentaprobes, a set 
of six high-diversity oligonucleotides that contain all possible five- 
base-pair sequences [42]. Data from our lab indicate that bona 
fide DNA-binding proteins will typically bind to Pentaprobes in a 
gel shift assay. However, we observed no binding of FOG-PR to 
any of the six double-stranded Pentaprobes. Similarly, no binding 
was observed to single-stranded forms of the Pentaprobes (data not 
shown). 

The PR domain of PRDM2 (RIZ) has been shown in GST- 
puUdowns to act as a protein interaction domain, mediating 
homodimerization [43]. We tested the protein-binding capability 
of FOG-PR by binding GST-FOG-PR to glutathione agarose 
beads and treating the beads with a nuclear extract from murine 
erythroleukemia (MEL) cells. SDS-PAGE analysis did not reveal 
any bands of significant intensity that were not observed in a GST- 
only control pulldown (data not shown). Furthermore, yeast two- 
hybrid screens carried out using FOG-PR as a bait and cDNA 



libraries from murine erythroleukemia (MEL) or K262 cells as 
prey did not yield any high-confidence hits (data not shown). 

Implications for PR domain function 

Although it is well accepted that the PR family of proteins acts 
to regulate gene expression, there is not yet a clear consensus on 
the biochemical mechanisms through which they achieve this 
outcome. Only a subset of the proteins have been demonstrated to 
display methyltransferase activity and, given the lack of clearly 
identified catalytic residues, it is possible that the observed activity 
arises (at least in some cases) from co-purified proteins. PR 
domains lacking catalytic activity might still function as interaction 
modules that recognize methylated histone tails or methylated 
sequences from other proteins. Such a binding activity could serve 
to modulate their function by either influencing their targeting to 
specific genomic loci or by regulating their binding to other 
protein partners that can be 'tagged' by lysine methylation. It is 
notable that a catalytically inactive SET domain has also been 
observed in the human protein SUVH9 [44] ; the function of this 
domain is also currentiy unresolved. 

DNA-binding activity has been reported for several family 
members, consistent with the presence of multiple classical zinc 
finger domains (reviewed in [18]). In contrast, the zinc fingers of 
FOGl do not appear to bind DNA (unpubhshed data). It is, 
however, known that zinc fingers in FOGl act as protein 
recruitment domains, binding to GATAl [3,4] and to TACC3 
[45] and it is possible that a subset of the zinc fingers in PRDM- 
family proteins (especially zinc fingers that are not part of a 
tandem repeat) likewise act to recognize protein partners. 

Other protein-recruitment motifs exist in PRDM-famUy pro- 
teins, including KRAB and AWS repressor domains. FOGl also 
harbors several domains that are associated with recruiting 
corepressors, including an N-terminal sequence that recruits the 
Nucleosome Remodeling and Deacetylase (NuRD) complex to 
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chromatin and a C-terminal Binding Protein (CtBP) binding motif 
[12] (Figure 1). Notably, PRDM2, 3 and 16 also contain CtBP 
binding motifs. 

In summary', our data show that FOGl contains a PR domain. 
The presence of this domain, together with other structural and 
functional similarities, defines FOG 1 as a new member of the PR- 
domain-containing protein family. This family of transcriptional 
regulators is likely to share a common mechanism of action and a 
broader elucidation of the biochemical function of the PR domain 
will illuminate the activity of the whole family. 

Materials and Methods 

Protein production 

GST-mFOGl(100-254) (FOG-PR) in pGEX-2T was overex- 

pressed in the BL21 strain of Escherichia coli (0.4 mM IPTG, 
37°C). For overexpression of '''N- or ' 'N,''^C-labeled FOG-PR, 
the protocol of Cai ei al. [46] was used, with induction occurring 
overnight at 25°C. Cell pellets were lysed by sonication (in 50 mM 
Tris pH 8.0, 50 mM NaCl, 1% Triton X-100, 1 mM PMSF, 
1 mM 6-mercaptoethanol, 0.1 mg/mL lysozyme), clarified by 
centrifugation and the supernatant bound to a glutathione 
Sepharose column at 4°C for 1-2 h. After washing (50 mM Tris, 
150 mM NaCl, 2.5 mM CaCl2), the fusion protein was either 
eluted with glutathione or cleaved with thrombin (37"C, 2-3 h). 
The eluted and concentrated protein (3-kDa cutoff Centricon) was 
further purified on a Superdex 200 16/60 or Superose 12 HR 10/ 
30 column (GE) in 50 mM sodium phosphate, 150 mM NaCl, 
1 mM DTT, pH 7.2. For NMR measurements, FOG(100-254) 
was prepared at 0.5-1.0 mM in a solution containing 20 mM 
Na2HP04/NaH2P04, pH 7.0 (5% ^HjO) and 2 ^M 5,5- 
dimethylsUapentanesulfonate (DSS). 

NMR spectroscopy and structure calculations 

Resonance assignments were made using a standard set of triple 
resonance experiments and NOE data were obtained from ''^C- 
NOESY-HSQC' (in >99% ^HjO) and ' "N-NOESY-HSQC 
spectra. All NMR spectra were recorded at 298 K on a Bruker 
Avance 600 MHz spectrometers, processed using TOPSPIN and 
analyzed using Sparky (T. D. Goddard and D. G. Kneller, 
SPARKY 3, University of California, San Francisco). '''N 
backbone relaxation experiments (Tj, T2 and heteronuclear 
NOE) were performed using standard Bruker pulse programs 
and were analyzed to extract relaxation rates using Sparky. 
Backbone cp and \|/ dihedral angle restraints were derived from the 
assigned backbone chemical shifts using TALOS+ [47]. Automat- 
ed NOE assignment and structure calculations were carried out 
using CYANA [32] and the lowest energy structures were refined 
using the RECOORD protocol [33]. The 20 conformers with the 
lowest energy were used to represent the solution structure of 
FOG-PR and deposited in the Protein Data Bank (PDB accession 
number 2mpl). Geometrical properties were assessed using 
PROCHECK_NMR [48]. 

Limited proteolysis 

Limited proteolysis was carried out by treating 100 |a,g 
FOGl (100-254) with 1 (J.g of chymotrypsin in a 20-|XL reaction 
volume for 4-10 min, separating the reaction mixture by SDS- 
PAGE and analyzing the major bands by peptide mass finger- 
printing on a MALDI mass spectrometer. 

Titration with AdoHcy 

AdoHcy in a matched buffer was added to '''N-labeled FOG- 
PR [in 20 mM Na2HP04/NaH2P04, pH 7.0 (5% ^H20) and 



2 |J,M DSS] giving final concentrations of up to 20 mM AdoHcy. 
'^N-HSQC spectra were recorded as above. 

Gel shift assays 

The double-stranded probes were end-labeled with ^^P accord- 
ing to standard procedures using polynucleotide kinase and 
purified on native polyacrylamide gels by standard methods 
[42]. Gel shift reactions were set up in a total volume of 30 |Xl, 
comprising approximately 1 pmol of '^^P labeled probe, ~100 ng 
of recombinant protein, 10 mM Hepes, pH 7.8, 50 mM KCl, 
5 mM MgCl2, 1 mM EDTA and 5% glycerol. After incubation on 
ice for 10 min, the samples were loaded onto a 6% native 
polyacr^'lamide gel made up in 0.5 x TBE. The gel was then 
subjected to electrophorc-sis at 1 5 V/ cm and 4°C for 3 h, dried, 
analysed and quantified when necessary using a Phospholmager 
(Molecular Dynamics). 

Pulldown assay 

Mouse erythroleukemia (MEL) cells were cultured in DMEM 
medium (+glucose, +glutamine, +pyruvate) supplemented with 5% 
FBS and 1% penicillin/streptomycin. 10-20 mL seed cultures 
were maintained in T75 flasks. For 1 L grow-ups, 1 mL of seed 
culture was added to 250 ml fresh medium in a T175 flask and 
grown at 37°C, 5% CO2 for 72 h to a density of ~ 1 x 1 0'^ cells/mL 
(viability >85%). Cells were harvested by centrifugation at 
2000 rpm for 5 min to yield 1-1.5 g (wet weight) ceUs/L culture. 
Cells were washed twice with PBS, then swollen in hypotonic 
solution (10 mM HEPES, 1.5 mM MgCl2, 10 mM KCl, pH 7.9) 
for 20 min and frozen in liquid nitrogen and stored at -80°C until 
use. Frozen and swollen cells were thawed at 37°C for 10 min and 
treated with IGEPAL (0.6% v/v) for 10 min. The mixture was 
centrifuged for 5 min at 2000 rpm to pellet nuclei and the 
cytoplasmic supernatant was discarded. The pellet was gentiy 
washed once with hypotonic solution (10 mM HEPES, 1.5 mM 
MgCl2, 10 mM KCl, pH 7.9, 0.6'% IGEPAL) and centrifuged 
again at 2000 rpm. Next buffer A (50 mM Tris, 150 mM NaCl, 
1% Triton X-100, 1 mM DTT, Complete protease inhibitors, 
pH 7.4) was added to the pellet (3 ml/g cells) and the mixture was 
sonicated on ice (step- tip, 1 0 x 1 s bursts with 1 0 s recovery, three 
times total) to give a milky white solution. The mixture was 
centrifuged at 13000 rpm for 10 min at 4°C. The clear nuclear 
extract was then used immediately. 

For the pulldown, GST-FOG-PR GSH beads (200 nL beads, 
0.5 mL beads per liter of £. coli lysate) was incubated with MEL 
nuclear extract (from 0.5 L MEL cell culture) overnight at 4°C. 
The beads were separated from the nuclear extract and washed 
three times with buffer A. Gel loading dye (EDS, 10 |J,L) was 
added to the wet beads and the sample heated for 10 min at 90°C. 
The mixture was then analysed by SDS PAGE. 

Protein Data Bank accession codes 

'H, '■^C and '^N backbone and sidechain chemical shift 
assignments have been deposited in the BioMagResBank with 

accession number 19988 and the structure coordinates have been 
deposited in the RCSB Protein Data Bank (accession code 2 mpl). 
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